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Abstract. In this paper, we present the issues we consider as essential as far as the statisti- 
cal mechanics of finite systems is concerned. In particular, we emphasis our present understand- 
ing of phase transitions in the framework of information theory. Information theory provides a 
thermodynamically-consistent treatment of finite, open, transient and expanding systems which are 
difficult problems in approaches using standard statistical ensembles. As an example, we analyze 
is the problem of boundary conditions, which in the framework of information theory must also be 
treated statistically. We recall that out of the thermodynamical limit the different ensembles are not 
equivalent and in particular they may lead to dramatically different equation of states, in the region 
of a first order phase transition. We recall the recent progresses achieved in the understanding of 
first-order phase transition in finite systems: the equivalence between the Yang-Lee theorem and 
the occurrence of bimodalities in the intensive ensemble and the presence of inverted curvatures of 
the thermodynamic potential of the associated extensive ensemble. We come back to the concept of 
order parameters and to the role of constraints on order parameters in order to predict the expected 
signature of first-order phase transition: in absence of any constraint (intensive ensemble) bimobal- 
ity of the event distribution is expected while an inverted curvature of the thermodynamic potential 
is expected at a fixe value of the order parameter (extensive ensemble) in between the phases (coex- 
istence zone). We stress that this discussion is not restricted to the possible occurrence of negative 
specific heat, but can also include negative compressibility's and negative susceptibihties, and in 
fact any curvature anomaly of the thermodynamic potential. 
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INTRODUCTION: UNUSUAL MESOSCOPIC WORLDS 

Everybody knows that when a liquid is heated, its temperature increases until the mo- 
ment when it starts to boil. The increase in temperature then stops, all heat being used 
to transform the liquid into vapor. What is the microscopic origin of such a strange be- 
havior? Does a liquid drop containing only few molecules behave the same? Recent 
experimental and theoretical developments seem to indicate that at the elementary level 
of very small systems, this anomaly appears in an even more astonishing way: during the 
change of state - for example from liquid to gas - the system cools whereas it is heated, 
i.e. its temperature decreases while its energy increases. This phenomenon is only one 
example of the fact that small systems when heated or compressed do not behave like the 
macroscopic systems we are used to. This paper presents a review of key issues about 
the thermodynamics of small systems in particular in presence of phase transition. 

In many different fields of physics, finite systems properties, non extensive thermo- 
dynamics, and phase transitions out of the thermodynamic limit are strongly debated 



issues(see for exaniple This may be the case of non saturating forces such as the 
gravitational ['2i Isi 3, 111] or the Coulombic forces. The system may be too small, as in 
the case of clusters and nuclei iSSS, @]. The physics of finite systems is even more 
complicated since often they are not only small but also open and transient. This imphes 
that the various concepts of thermodynamics and statistical mechanics [Hol lll. 12. 131 
have to be completed and revisited [Tlll4l.ll5l. ll6Lll7L ll8l.ll9]. 

A consistent framework to address those issues is the information theory approach to 
statistical mechanics [|l4L IT6|] . This formalism allows to address in a consistent way the 
statistical mechanics of open systems evolving in time, independent of their interaction 
range and number of constituents. After a short summary of the statistical physics 
concepts, we will first address the essential question of boundary conditions, which 
cannot be avoided when dealing with finite systems with continuum states. Recalling that 
the exact knowledge of a boundary requires an infinite information, we will show that 
a consistent treatment of unbound systems can still be achieved if boundary conditions 
are treated statistically, leading to new statistical ensembles. Then we will recall that for 
finite systems the different ensembles are not equivalent [[18, 21 , 22, 23|, 2^] In particular, 
two ensembles which put different constraints on the fluctuations of the order parameter 
lead to qualitatively different equations of state close to a first order phase transition[23|]. 

As an example, when energy is the order parameter, the microcanonical (at fixed en- 
ergy) heat capacity diverges to become negative while the canonical (at fixed tempera- 
tureione remains always positive and finite ESHSHHIlElsllssilMlls 



[f the magnetization is the order parameter, it is the magnetic susceptibility, or if 
the volume (or density) is the order parameter, it is the compressibility, or if the number 
of particle is the order parameter, it is the chemical susceptibility which are expected 
to present a negative branch in between two divergences in the fixed order parameter 
ensemble while in the ensemble in which the order parameter has only a mean value 
constrained by a Lagrange intensive parameter (magnetic field or pressure or chemical 
potential) should remain positive. This difference between ensembles can be of primor- 
dial importance for microscopic and mesoscopic systems undergoing a phase transition. 



Such systems are now studied in many fields of physics, from Bose condensates [|38l .l3 



to the quark-gluon plasma [ 40 , 41], from cluster melting [@, |42|] to nuclear fragmenta- 
tion [0]. Moreover, such inequivalences may survive at the infinite size limit for systems 
involving long range forces such as self-gravitating objects[|3il3,|5|]. 

Then, going deeper insight the formalism, we will summarize the mathematical equiv- 
alence between the Yang-Lee approach [43] through the zeroes of the partition sum in the 
space of complex intensive parameters associated with an order parameter, the bimodal- 
ity of the order parameter distribution in this intensive ensemble, and the anomalous 
(inverted) curvature of the thermodynamic potential of the ensemble where the order 
parameter is fixed [|44L |45L [46] . The best documented example in the literature is the 
bimodality of the canonical energy distribution and its equivalence to negative micro- 
canonical heat capacity [|47L l48L l49|] . As far as the time evolution problem is concerned, 
we stress the need to take into account time odd constraint in the statistical picture. 
Finally, we conclude about the important challenges related finite-systems thermody- 
namics. 



STATISTICAL MECHANICS DESCRIPTION OF FINITE SYSTEM 



When discussing tliermodynamics, people often implicitly refer to macroscopic systems. 
Indeed, when it is possible to take the thermodynamical limit of infinite systems the 
statistical physics simplifies since all statistical ensembles are equivalent. However, 
when this limit is not taken because it does not exist for the considered system (e.g. 
self-gravitating systems) or because the considered system is simply finite, one may 
think that the thermodynamics concept are becoming vague or even ill-defined. Are 
concepts like equilibrium, temperature, pressure etc. applicable to objects as tiny as 
nuclei. How large must be a system for a temperature to be defined? These questions 
originates from a confusion between a possible uncertainty about the statistical ensemble 
corresponding to the studied system and a fundamental problem on thermodynamical 
concepts. Only the former is a real issue. Indeed, when a statistical ensemble is defined 
all thermodynamical quantities are clearly mathematically defined. At the basis of this 
discussion, the concept of equilibrium is also well defined since in the Gibbs spirit it 
is nothing but the ensemble of events maximizing the entropy under the considered 
constraints. Then, a concept like the temperature is well define the problem being that 
it is not the same definition in the various ensemble, then the question is what is the 
physical meaning of thermodynamic quantities - say, temperature - evaluated through 
different ensembles? is there a '"correct"' ensemble to be used? 

This discussion becomes even stronger when phase transitions are concerned. Indeed, 
when the thermodynamical limit can be taken phase transitions are well defined in all 
text book as non-analytical properties of thermodynamical potentials. However, this 
anomalous behavior being generated when taking the limit of infinite systems this 
definition of phase transition cannot be operational for finite systems. However, we 
all know finite systems can change state or shape, a typical example being the case 
of isomerization; how many degrees of freedom do we need in order to call this change 
of state a phase transition? 

To start answering those questions about the thermodynamics concepts applied to 
finite systems we can first look at simple systems. Let us consider a system that can exist 
in two single microstates of different energy (a single spin in a magnetic field, a two-level 
atom in a bath of radiation...) The system being much smaller than its environment, let 
us consider the case for which the interaction between system and environment can be 
neglected and we have no reason to believe that the environment will be in any specific 
state. Then the distribution of the system microstates is simply given by the number of 
states of the environment 



where Ef is the total energy (system + environment) and S = logW is the (micro- 
canonical) entropy associated to the environment. Since Cn <^ Et, a Taylor expansion of 
the entropy gives 
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where we have introduce /3 = dSdE, the temperature of the environment. 

This very simple textbook exercise gives us a number of interesting information: 



• thermodynamic concepts like temperature can be defined for systems having an 
arbitrary number of degrees of freedom (the minimum being 2 levels) 

• Boltzmann-Gibbs statistics naturally emerges as soon as we observe a limited 
information constructed from a reduced number of degrees of freedom 

If we now take into account a slightly more complicated system with energy states 
associated to a degeneracy w{e), the energy distribution will be modified to 

^ wie)W{Et-e) _ w{e)exp{-l5e) 
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where the canonical approximation is still correct if the system is associated to a much 
smaller number of degrees of freedom than its environment. Eq.© gives for instance the 
energy distribution of a thermometer loosely coupled to an otherwise isolated system. 
Temperature is defined as the response of the thermometer in the most probable energy 
state e; if we maximize the distribution ([3]) we get, assuming that energy can be treated 
as a continuous variable 

dlogW . dlogw. 

We then learn that the quantity shared at the most probable energy partition is the 
microcanonical temperature. This shows that there is no ambiguity in the definition of 
temperature (and any other thermodynamic quantity) when dealing with small systems. 

It is important to note that eq.© is not limited to the observation of energy, but can 
apply to the distribution of any generic observable A = {A). We can then expect that 
canonical-like ensembles (i.e. ensembles where distributions are given by Boltzmann 
factors) will arise each time that we are isolating a small number of degrees of freedom 
from a more complex system. More generally, we will recall in the next section that a 
statistical description is in order each time that the system is complex enough to have 
a large number of microstates associated to a given set of relevant observables. If the 
relevant observables are recognized, equilibrium is therefore a very generic concept of 
minimum information theory. The proper statistical ensemble i.e. the relevant observ- 
ables depend on the dynamics of the considered system and the way it is prepared. 



INFORMATION THEORY 

Information theory is the general framework which provides the foundation of statistical 
mechanics. It should be noticed that it alsogives the framework for the generalization 



of the Density Functional Theory (DFT) (SOD. It leads to a consistent treatment of the 



thermodynamics of finite systems both in the classical and in the quantal world[[l6l [l4l] . 
Let us summarize here the essential ingredients. We will use here quantum mechanics 
notations. Classical approaches can immediately be defined as a classical limit of the 
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FIGURE 1. Illustration of the Liouville space of density matrices: an observation (A,) is a projection 
of D on the axis associated with the corresponding observable A,. 



presented results. Statistical physics treats statistical ensembles of possible solutions for 
the considered physical system. 



Liouville Space 

Such a "macro-state" can be represented by its density matrix 

(5) 

VJ/(n) 



where the states ("micro-states", or "partitions", or "replicas", or simply "events") 
pertain to the considered Fock or Hilbert space, p^"^ is the occurrence probability of the 



event 



vi/(") \ _ xhe result of the measurement of an observable A is 



<A>^=TrAD, (6) 
where Tr means the trace over the quantum Fock or Hilbert space of states { |^) }. In the 



space of Hermitian matrices, the trace provides a scalar product [|5lL |52|] 



<<A||D»=TrAD. (7) 

It is then possible to define an orthonormal basis of Hermitian operators {Oi} in the 
observables space, and to interpret the measurement < <9/ >^ as a coordinate of the 
density matrix D (see figlB- The size of the observables space is the square of the 
dimension of the Hilbert or Fock space, which are in general infinite; therefore in order 



to describe the system, one is forced to consider a reduced set of (collective) observables 
{Ag} which are supposed to contained the relevant information. 



Maximum Entropy State and Generilized Gibbs Equilibrium 

The Gibbs formulation of statistical mechanics can then be derived if the least 
biased"macro-state" is assumed to be given by the maximization of the entropy Q 

S[D] = -TrDlogD, (8) 

which is nothing but the opposite of the Shannon information [[l^ [I?] . It is important to 
notice that this formalism can be generalized introducing alternative entropies such as 
the Tsallis entropy. Eq.® is a definition of entropy valid for any density matrix which 
coincides with the standard thermodynamic entropy only after maximization, see eq. 
(fT3l) below. 

If the system is characterized by L (relevant) observables (or "extensive" variables H), 
A = {Ag}, known in average < >= TrDAg, the variation of the density matrix is not 
free. The standard way to solve this maximization of the entropy under constraints is to 
maximize with no constraints the constrained entropy 

y = 5-£A^<A^>, (9) 

e 

where the A = {A^} are L Lagrange multipliers associated with the L constraints <A£>. 

A maximization of the entropy under constraints gives a prediction for the minimum 
biased density matrix (or " 'event distribution'") which can be viewed as a generalization 
of Gibbs equilibrium: 

Dx = ^oxp-X.A, (10) 

where A. A =L^=i ^e-^e and where Zx is the associated partition sum insuring the nor- 
malization of Dx : 

Zi=Trexp—A.A (11) 
Using this definition, we can compute the associated equations of state (EoS): 

<Af>=dx,logZx. (12) 
The entropy associated with is: 



In this article we implicitly use units such that the Boltzmann constant k= I. 
^ In this paper the word "extensive" is used in the general sense of resulting from an observation, i.e. the 
<A(: >, and not in the restricted sense of additive variable. Intensive variables are conjugate to extensive 
variables i.e. Lagrange multipliers kg imposing the average value of the associated extensive variable. 



S[Dx]=logZx+Y,^e<Ae>, (13) 

e 

which has the structure of a Legendre transform between the entropy and the thermody- 
namic potential. 

Ergodicity and Microcanonical Ensemble 

To interpret the Gibbs ensemble as resulting from the contact with a reservoir or 
to guarantee the stationarity of eq. (fTOl) . it is often assumed that the observables Ai 
are conserved quantities such as the energy H, the particle (or charge) numbers A^, or 
the angular momentum L i20ll . However, there is no formal reason to limit the state 
variables to constants of motion. Even more, the introduction of not conserved quantities 
might be a way to take into account some non ergodic aspects. Indeed, an additional 
constraint reduces the entropy, limiting the populated phase space or modifying the event 
distribution. This point will be developed at length in the next sections. 

It should be noticed that microcanonical thermodynamics also corresponds to a max- 
imization of the entropy ([8]) in a fixed energy subspace. In this case the maximum of the 
Shannon entropy can be identified with the Boltzmann entropy 



max(5) =logW(£), (14) 

where W is the total state density with the energy E. The microcanonical case can also be 
seen as a particular Gibbs equilibrium (fTOl) for which both the energy and its fluctuation 
are constrained. This so called Gaussian ensemble in fact interpolates between the 
canonical and microcanonical ensemble depending upon the constrain on the energy 



fluctuation ll25L 15311. and the same procedure can be applied to any conservation law. In 
this sense the Gibbs formulation (fTOl) can be considered as the most general. 

The ensemble of extensive variables constrained exactly or in average completely de- 
fines the statistical ensemble. This means that many different ensembles can be defined, 
and the most appropriate description of a finite system may be different from the stan- 
dard microcanonical, canonical or grand-canonical. 



Problem of the Finite Size 

Moreover,the microcanonical, canonical or grand-canonical ensembles require the 
definition of boundary condition. As discussed below, this definition is an infinite infor- 
mation incompatible with the Gibbs ideas. One can rather introduce boundaries through 
additionalconstraints Ai taking advantage of the general formalism above. It should be 
noticed that the definition of statistical ensemble forwhich the boundaries are defined 
using constraints such as < > or more complex moments of the matter distribution 
can also be used for opensystems considered at a finite time as discussed in reference 
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FIGURE 2. Illustxation of three different ensembles for a finite system: a system in a cubic box with a 
fixed volume which can be seen as an isochore ensemble; a system constrained by its average size playing 
the role of a confining pressure, which can be therefore called an isobar ensemble; and a quadrupole 
deformation. 



Figure 2 illustrates this diversity of statistical ensembles concerning the size observ- 
ables of a finite system with three cases: a fixed volume with a specific shape, a size 
known only in average and a quadrupole deformation. Many other ensembles can be de- 
fined using higher multipoles as well as other observables, including time odd quantities 
such as angular momentum I = fAp or radial flow r- p. Such constraints are an explicit 
way to describe statistical ensembles associated with time evolution 1154 15511. 



Finite size and boundary conditions 

The above discussion stresses the important problem when considering finite size 
systems which is the need to define boundary conditions to define the finite size. This is 
only a mathematical detail for "condensed" systems, i.e. finite size self-bound systems in 
a much larger container, or particles trapped in an external confining potential [56]. In the 
other cases, finite size systems can only be defined when proper boundary conditions are 
specified. Conversely to the thermodynamic limit which, when it exists, clearly isolates 
bulk properties independent of the actual shape of the container, finite size systems 
explicitly depend on boundary conditions. 

From a mathematical point of view the system Hamiltonian H is not defined until 
boundary conditions are specified. For example for a particle problem a boundary can 
be the definition of a surface given by the implicit equation o{x,y,z) = 0. (see figure |3]). 
Since the Hamiltonian Ha explicitly contains the boundary, the entropy Sa also directly 
depends upon the definition of this boundary, according to 



o (x,y,z) = 



FIGURE 3. Schematic representation of the boundary condition around a nucleus computed with a 
mean-field approach as a surface defined by the implicit equation a{x,y,z) = which can be any shape. 



Sa{E)=\ogtrd{E-Ho). (15) 

The first conclusion is then that for a finite system the microcanonical ensemble is ill- 
defined and so the thermodynamic properties of finite systems directly depend upon 
the boundary conditions, i.e. the size and shape and all small details of the considered 
container. 

This brings an even more severe conceptual problem; the knowledge of the boundary 
requires an infinite information: the values of the function o defining the actual surface 
in each space point. This is easily seen introducing the projector P(y over the surface and 
its exterior. Indeed the boundary conditions applied to each microstate 



P^xi/w^^Q (16) 

is exactly equivalent to the extra constraint 

<Pa>=i:rDPa = . (17) 

If we note again A the observables characterizing a given equilibrium, the density matrix 
including the boundary condition constraints reads 

Dx^ = ^exp-X.A-bPa (18) 

which shows that the thermodynamics of the system does not only depend on the La- 
grange multiplier b, but on the whole surface. For the very same global features such 



as the same average particle density or energy, we will have as many different ther- 
modynamics as boundary conditions. More important, to specify the density matrix, the 
projector Pa has to be exactly known and this is in fact impossible in actual experiments. 

Moreover, the nature of Pa is intrinsically different from the usual global observables 
A£. At variance with the Ag, Pa is a many-body operator which does not correspond to 
any physical measurable observable. The knowledge of Pa requires the exact knowledge 
of each point of the boundary surface while no or few parameters are sufficient to 
define the Af . If we consider statistical physics as founded by the concept of minimum 
information ifTil [l6ll . it is incoherent to introduce an exact knowledge of the boundary. 

One should rather apply the minimum information concept also to the boundaries, 
for example introducing a hierarchy of collective observables which define the size 
and shape of the considered system. This amounts to introduce statistical ensembles 
treating the boundaries as additional extensive variables fixed by conjugated Lagrange 
parameters [55]. If for instance we consider that the relevant size information for an 
unbound system is its global square radius < >, the adequate constraint 

-XR^ (19) 

should be introduced in the statistical treatment while all boundary conditions should be 
removed to infinity. 

MAXIMUM ENTROPY STATES 

As we have discussed in the previous sections, a statistical treatment is best suited 
whenever a very large number of microstates exists for a given set of observables. An 
ensemble of events coming from similary prepared initial systems and/or selected by 
sorting the final states using experimental observations always constitutes a statistical 
ensemble. Indeed, if we are able to recognize all the relevant degrees of freedom (i.e. 
the observations with a strong information content) the ensemble of replicas is by 
construction a statistical ensemble, i.e. a Gibbs equilibrium in the extended sense of 
section above. Indeed, if we are able to show that the observed ensemble deviates 
from the maximum entropy state this means that at least one observable give a different 
observation in the actual state and in the maximum entropy state. Then, one should 
introduce this observable as a relevant information and modify a maximum entropy 
state. Then, iteratively, one is extracting the relevant observable space and at the end 
of the process the actual state is then identical to the maximum entropy state. Of course 
nothing is preventing that at the end of the process the whole Liouville space is needed as 
a relevant information space. Then the maximum entropy state is nothing but a complete 
description of any state and in this case the statistical mechanics tools are not useful. El 

However the whole idea behind the second principle of thermodynamics and the 
Gibbs ansatz and the information theory is that in most physical system the relevant 



^ A way out can be to introduce another entropy better suited to treat the relation between the information 
and the associated state as discussed in other contributions in this book. We will not elaborate more on 
this point here and rather refocus the discussion on the generalized Gibbs equilibria. 



information is only a small subset of the possible observations. The fundamental prob- 
lem is then the recognition of the relevant observables. A generic procedure is to make 
assumptions for the relevant information and then to compare the maximum entropy 
state prediction for other observables with the actual observations. Any important obser- 
vation signals a missing part of the information to be possibly introduced as a constraint 
correcting the observed deviation. 

The theoretical justification of this minimal statistical picture comes from the fact that 
complex classical systems subject to a non linear dynamics are generally mixing[60]. In 
such a case the statistical ensemble is created by the propagation in time of initial fluc- 
tuations. The averages are averages over the initial conditions and the mixing character 
of the dynamics (if it can be proved) insures that the initial fluctuations are amplified in 
such a way that the ensemble of events covers the whole phase space uniformly. For a 
classical dynamics which conserves the phase space volume of the ensemble of events, 
this means that the initial distribution is elongated and folded in such away that it gets 
close to any point of the phase space (the so-called baker transformation). This classical 
picture can be replaced in the quantum case by the idea of projection of irrelevant corre- 
lations (5^. The phase space can be described as a subspace of all possible observations. 
The regular quantum dynamics in the full space is transformed into a complex dynamics 
by the projection in the relevant observation sub-space. Then two different realizations 
corresponding to the same projection, i.e. the same point in the relevant space, may differ 
in the full space (and consequently in their successive evolution) because of the unob- 
served correlations. This ensemble of correlations may lead to a statistical ensemble of 
realizations after a finite time. This phenomenon is often described introducing stochas- 
tic dynamics, i.e. assuming that the unobserved part of the dynamics which is averaged 
over is a random process [I6ll.l62ll. 



Distance to equilibrium 

An important point to be discussed is the justification of the statistical description. 
As we have just mentioned, the applicability of a statistical picture is in most cases an 
hypothesis (or a principle like in the thermodynamics second law). Therefore, the equi- 
librium hypothesis should be a posteriori controlled. Different properties can provide 
tests of equilibration such as 

• the comparison with statistical models, 

• the consistency of thermodynamical quantities, namely the compatibility of the 
different intensive variables measurements (e.g. of the different thermometers) or 
the fulfillment of thermodynamic relations between averages and fluctuations (e.g. 

= d^logZx/dXf = -d <Ai> jdXi since <Ai >= -dlogZx/dXi ) 

• the memory loss or the independence of the results on the preparation method of 
the considered ensemble. 



meaning that any phase space point gets close to at least one event. 



However, it should be stressed that the real question is not whether the system is at 
equilibrium, but rather how far is it from a given equilibrium? Indeed, equilibrium is 
a theoretical abstraction which cannot be achieved in the real world H. To answer this 
question we should define a distance. The first idea could be to use the Liouville metric 

d% = ix{b-b),f (20) 

between the actual ensemble characterized by the density matrix D, and the equilibrium 
one Dx computed for the same collective variables < A( > . This is a nice theoretical 
tool, but a rather difficult definition as far as experiments are concerned. 



Another possibility is to introduce entropy as a metric [|61l] 



ds=\S[D\-S[bx]\/S[bx] (21) 

This is a way to measure how far the system is from the maximum entropy state (or 
in other words to measure how much information on the actual system is included in 
the collective variables {< >} and how much is out of the considered equilibrium 
looking at ( 1 — ^5) . This is a more physical distance but again it is difficult to implement 
in real experimental situations. 

A more practical measurement of the distance to equilibrium is to focus on the infor- 
mation used to deduce physical properties. Since the information about the actual system 
is contained in the observations <0i>, the natural space to introduce this distance is the 
observation space. This is a formally well defined problem since considering TxOiOj as 
the scalar product between observables, the observation space has a well defined topol- 
ogy. Then, when orthogonal observables are considered |^ the distance to equilibrium is 
simply 

d,= \<d,> -<di>eq\ (22) 

Of course the considered observables should not be the one used to describe the statis- 
tical equilibrium. A typical example is given by the difference between the measured 
fluctuations o^^ =<A^> — <A^>^ and the expected ones = —d <Ai> / dX^ in 
the ensemble controlled by the . 



FINITE SYSTEMS AND ENSEMBLE INEQUIVALENCE 

The Van Hove theorem[63] is a fundamental theorem in statistical mechanics, since 
it guarantees the equivalence between different statistical ensembles at the thermody- 
namic limit of systems presenting only finite range interactions. In finite systems, this 
equivalence can be strongly violated in particular in first order phase transitions re- 
gions. This violation can persist up to the thermodynamic limit in the case of long 
range forces. The non-equivalence of statistical ensembles has important conceptual 



For example, the Boltzmann law should be an exponential decay up to an infinite energy. 
^ if observables are not orthogonal it is alway possible to use a Schmitt procedure to define a set of 
orthogonal observables llT6ll . 



consequences. It implies that the value of thermodynamic variables for the very same 
system depends on the type of experiment which is performed (i.e. on the ensemble 
of constraints which are put on the system), contrary to the standard thermodynamic 
viewpoint that water heated in a kettle is the same as water put in an oven at the same 
temperature. Ensemble inequivalence is the subject of an abundant literature (see for 



example ref s. 11231. 1241 126L 1271 128L I29L I30L '31] for a discussion in a general context, and 
refs.[32, 33,Q^35, 361 37ll concerning phase transitions ). 

Generally speaking, for a given value of the control parameters (or intensive vari- 
ables) Xf , the properties of a substance are univocally defined, i.e. the conjugated exten- 
sive variables <Ai> have a unique value unambiguously defined by the corresponding 
equation of state 

<A^>=-dx,\ogZ{{X(} (23) 

In reality, this fixes only the average value and the event by event value of the observation 
of produces a probability distribution. The intuitive expectation that extensive vari- 
ables at equilibrium have a unique value therefore means that the probability distribution 
is narrow and normal, such that a good approximation can be obtained by replacing the 
distribution with its most probable value. The normality of probability distributions is 
usually assumed on the basis of the central limit theorem. However, in finite systems 
the probability distributions has a finite width and moreover it can depart from a normal 
distribution. We will discuss in particular the case of a bimodal distribution [44]: in this 
case two different properties (phases) coexist for the same value of the intensive control 
variable. 

The topological anomalies of probability distributions and the failure of the central 
limit theorem in phase coexistence imply that in a first order phase transition the different 
statistical ensembles are in general not equivalent and different phenomena can be 
observed depending on the fact that the controlled variable is extensive or intensive. 
In the following we will often take as a paradigm of intensive ensembles the canonical 
ensemble for which the inverse of the temperature /3^Ms controlled, while the archetype 
of the extensive ensemble will be the microcanonical one for which energy is strictly 
controlled. 



Laplace versus Legendre Transformations 

The relation between the canonical entropy and the logarithm of the partition sum 
is given by a Legendre transform eq.(fT3l). It is important to distinguish between trans- 
formations within the same ensemble, as the Legendre transform, and transformations 



between different ensembles, which are given by non linear integral transforms [|36ll . Let 
us consider energy as the extensive observable and inverse temperature /3 as the conju- 
gated intensive one. The definition of the canonical partition sum is 

Z/3=Iexp(-/3£W), (24) 



where the sum runs over the available eigenstates n of the Hamiltonian. Here, we assume 
that the partition sum converges; this is not always the case as discussed in ref.tMJ- The 



possible divergence of the thermodynamic potential of the intensive ensemble is already 
a known case of ensemble inequivalence! 20l 1641. Computing the canonical (Shannon) 



entropy we get 



Scan{< E>)= \ogZp +/3 < £ >, (25) 

which is an exact Legendre transform since the EoS reads < E >= —dp log Z^g . If energy 
can be treated as a continuous variable, eq.(l24l) can be written as: 



Zp = dEW{E)exp{-l5E), (26) 

where energies are evaluated from the ground state. Eq.(l26l) is a Laplace transform 
between the canonical partition sum and the microcanonical density of states linked to 
the entropy by Se = lnW{E) . If the integrand f{E) = exp(5£- — /3£') is a strongly peaked 
function, it can be approximated by a gaussian (saddle point approximation) so that the 
integral can be replaced by the maximum f{E) times a Gaussian integral. Neglecting 
this factor we get 

Zp^WiE)cxp{-l5E), (27) 

which can be rewritten as 

InZp^SE-pE; (28) 
or introducing the free energy Fj = -^^^InZp 

Ft ^ E-TSe- (29) 

Eq.(l28l) has the structure of an approximate Legendre transform similar to the exact 
espression (l25l) . This shows that in the lowest order saddle point approximation eq.dTTI). 
the ensembles differing at the level of constraints acting on a specific observable (here 
energy) lead to the same entropy, i.e. they are equivalent. We will see in the next 
section that however the saddle point approximation eq.( [27l) can be highly incorrect 
close to a phase transition for the simple reason that the integrand is bimodal making 
a unique saddle point approximation inadequate. In this case eq.(l28l) cannot be applied, 
eq.(l26l) is the only correct transformation between the different ensembles, and ensemble 
inequivalence naturally arises. 

To summarize one should not confuse 

• the link between the thermodynamical potential of the intensive (e.g.log of canon- 
ical partition sum) and of the extensive ensemble (e.g. themicrocanonical entropy) 
which are always related with a Laplace transform.This Laplace transform may 
lead to an approximate Legendre transformationfor normal distributions but we 
know that this Legendre transformation isonly approximate and might be present 
strong deviations if the distributionis abnormal. 

• with the exact Legendre transform between the entropy of theintensive ensemble 
and the corresponding thermodynamical potential. 



This simply corresponds to the fact that the microcanonical and canonicalentropies can 
be very different. 



Ensemble Inequivalence in Phase Transition Regions 

Let us consider the case of a first order phase transition where the canonical energy 
distribution 

Pp^ (E) = W(£)exp(-j8o£)/Z^„ (30) 



has a characteristic bimodal shape |44l 1471 . |48|] at the temperature /3o with two maxima 
E^p \ E^p^ that can be associated with the two phases. It is easy to see that eq.(l26l) can 
also be seen as a Laplace transform of the canonical probability Pp^^ (E) 

Zp=Zp,j^ dEPp^{E)cxp{-{p~l3o)E). (31) 

A single saddle point approximation is not valid when Pj^^ (E) is bimodal; however it is 
always possible to write 

P,=m|,"p;"+mPVf, (32) 

with Pp^ mono-modal normalized probability distribution peaked at E^^ . The canonical 
mean energy is then the weighted average of the two energies 

{E)p=m^^^E^P+m^pEf\ (33) 



with 

m 



f = mf J dEPf {E)E/Ef ^ mf . (34) 

Since only one mean energy is associated with a given temperature j8 ^ the canonical 
caloric curve is monotonous, and the microcanonical one is not. Indeed it is immediate 
to see from eq.(l30l) that the bimodality of Pp implies then a back bending of the 

microcanonical caloric curve = dgS, meaning that in the first order phase transition 
region the two ensembles are not equivalent. If instead of looking at the average {E)p we 

look at the most probable energy Ep , this (unusual)canonical caloric curve is identical 

to the microcanonical one, up to the transition temperature for which the two 
components of Pp (E) have the same height. At this point the most probable energy 
jumps from the low to the high energy branch of the microcanonical caloric curve. 

The question arises whether this violation of ensemble equivalence survives towards 
the thermodynamic limit. This limit can be expressed as the fact that the thermodynamic 
potentials per particle converge when the number of particles goes to infinity : 

fN,p = r'^^ - /i3 ; ^A' (^) = ^ - He) (35) 



where e = E /N. Let us also introduce the reduced probability pf^^ (e) = [P^ {N, E)^ 
which then converges towards an asymptotic distribution 



PN.pie) ^ pp{e) ; pp{e) = exp {s{e) - pe + fp) . (36) 

Since Pp {N, E) [p^ (e))^, one can see that when (e) is normal, the relative energy 
fluctuation in (N, E) is suppressed by a factor 1 / ^/N. At the thermodynamic limit P^ 
reduces to a 5-function and ensemble equivalence is recovered. To analyze the thermo- 
dynamic limit of a bimodal p^^ ^ {e), let us introduce as before the temperature for 
which the two maxima of pi^ ^ (e) have the same height. For a first order phase transition 

j8^J converges to a fixed point j8^^^ as well as the two maximum energies e^-*^ e^^ . 

For all temperatures lower (higher) than j8f^^ only the low (high) energy peak will sur- 
vive at the thermodynamic limit, since the difference of the two maximum probabilities 

will be raised to the power A^. Therefore, below e^^^ and above e^^'* the canonical caloric 
curve coincides with the microcanonical one in the thermodynamic limit. In the canon- 
ical ensemble the temperature j8f^^ corresponds to a discontinuity in the state energy 

irrespectively of the behavior of the entropy between e^^ and e^^\ 

The microcanonical caloric curve in the phase transition region may either converge 
towards the Maxwell construction, or keep a backbending behavior^22'], since a negative 
heat capacity system can be thermodynamically stable even in the thermodynamic limit 
if it is isolated [26] . Examples of a backbending behavior at the thermodynamic limit 
have been reported for a model many-body interaction taken as a functional of the hy- 
pergeometric radius in the analytical work of ref.[3], and for the long range Ising model 
This can be understood as a general effect of long range interactions for which the 
topological anomaly leading to the convex intruder in the entropy is not cured by in- 
creasing the number of particles lU, |65l]. Conversely, for short range interactions [15] 



the backbending is a surface effect which should disappear at the thermodynamic limit. 
This is the case for the Potts model[33], the microcanonical model of fragmentation 



of atomic clusters [|66|1 and for the lattice gas model with fluctuating volume 14911 . The 
interphase surface entropy goes to zero d& N ^ m these models, leading to a linear 
increase of the entropy in agreement with the canonical predictions. Within the approach 
based on the topology of the probability distribution of observables it was shown 
that ensemble inequivalence arises from fluctuations of the order parameter ll23ll . En- 
sembles putting different constraints on the fluctuations of the order parameter lead to a 
different thermodynamics. In the case of phase transitions with a finite latent heat, the 
total energy usually plays the role of an order parameter except in the microcanonical 
ensemble which therefore is expected to present a different thermodynamics than the 
other ensembles i20|] . This inequivalence may remain at the thermodynamic limit if the 
involved phenomena are not reduced to short range effects. 




Energy (o 

FIGURE 4. Left panels: temperature as a function of the potential energy E2 (full lines) and of the 
kinetic energy E — E2 (dot-dashed lines)for two model equation of states of classical systems showing 
a first order phase transition. Symbols: temperatures extracted from the most probable kinetic energy 
thermometer from eg. (1381). Right panels: total caloric curves (symbols) corresponding to the left panels 
and thermodynamic limit of eq.(l42t (dashed lines). 

Temperature jump at constant energy 

In particular, it may happen that the energy of a subsystem becomes an order 
parameter when the total energy is constrained by a conservation law or a micro- 
canonical sorting. This frequently occurs for Hamiltonians containing a kinetic energy 
contribution Jsl 0, if the kinetic heat capacity is large enough, it becomes an or- 
der parameter in the microcanonical ensemble. Then, the microcanonical caloric curve 
presents at the thermodynamical limit a temperature jump in complete disagreement 
with the canonical ensemble. 

To understand this phenomenon, let us consider a finite system for which the Hamil- 
tonian can be separated into two components E = Ei + E2, that are statistically inde- 
pendent {W{Ei,E2) = Wi{E[)W2{E2)) and such that the associated degrees of freedom 
scale in the same way with the number of particles; we will also consider the case where 
Si = logWi has no anomaly while ^2 = log W2 presents a convex intruder lfisll which is 
preserved at the thermodynamic limit. Typical examples of Ei are given by the kinetic 
energy for a classical system with velocity independent interactions, or other similar one 
body operators [4]. The probability to get a partial energy Ei when the total energy is E 
is given by 



(£1 ) = exp (5i (£1 ) + 52 (£ - £1 ) - 5 (£) ) 



(37) 



The extremum of Pe {E\ ) is obtained for the partitioning of the total energy E between 
the kinetic and potential components that equalizes the two partial temperatures 



Ti =dEMEi) = dEME-Ei) = T2 . (38) 

If E\is unique, Pe {E\) is mono-modal and we can use a saddle point approximation 
around this solution to compute the entropy 

5(£)=log r J£iexp(5i(£i)+52(£-£i)). (39) 

J — oo 

At the lowest order, the entropy is simply additive so that the microcanonical tempera- 
ture of the global system dES{E) = T ^ is the one of the most probable energy partition. 
Therefore, the most probable partial energy Ei acts as a microcanonical thermometer. If 
Ei is always unique, the kinetic thermometer in the backbending region will follow the 
whole decrease of temperature as the total energy increases. Therefore, the total caloric 
curve will present the same anomaly as the potential one. If conversely the partial en- 



ergy distribution is double humped 116811 . then the equality of the partial temperatures 



admits three solutions, one of them E^^^ being a minimum. At this point the partial heat 



'1 

capacities 



Cfi = -T^dlSiiEf^) ; C^' = -T^dlS2iE-Ef^) (40) 



fulfill the relation 



cr^+C7^<o (41) 



This happens when the potential heat capacity is negative and the kinetic energy is 
large enough (Ci > — C2) to act as an approximate heat bath: the partial energy dis- 
tribution Pe{E\) in the microcanonical ensemble is then bimodal as the total energy 
distribution P^ (E) in the canonical ensemble, implying that the kinetic energy is the 
order parameter of the transition in the microcanonical ensemble. In this case the micro- 
canonical temperature is given by a weighted average of the two estimations from the 
two maxima of the kinetic energy distribution 

T = dESiE) = ^ " ' (42) 

where T^'^ = Ti (£'['■') are the kinetic temperatures calculated at the two maxima, 

P^''' = Pe{E^i) are the probabilities of the two peaks and CT^'^ their widths. At the ther- 
modynamic limit eq. (|4T)) reads +C2 ^ < 0, with c = \\m^^ooC/N . If this condi- 
tion is fulfilled, the probability distribution Pp{E) presents two maxima for all finite 
sizes and only the highest peak survives at A'^ = 00 . Let Et be the energy at which 

Pei{E^^^) = Pe,{e'^'^^). Because of eq. (l42l) . at the thermodynamic limit the caloric curve 
will follow the high (low) energy maximum of Pe {Ei) for all energies below (above) Ef, 
there will be a temperature jump at the transition energy Et. Let us illustrate the above 




FIGURE 5. Canonical event distributions in the potential versus kinetic energy plane (left panels) and 
total versus kinetic energy plane (right panels) at the transition temperature for the two model equations 
of state of figure|4] The inserts show two constant total energy cuts of the distributions. 



results with two examples for a classical gas of interacting particles. For the kinetic en- 
ergy contribution we have Si{E) = ciln{E /N)^ with a constant kinetic heat capacity 
perparticle ci =3/2. For the potential part we will take two polynomial parametrization 
of the interaction caloric curve presenting a back bending which are displayed in the left 
part of figure HI in units of an arbitrary scale e. If the decrease of the partial temperature 
T2{E2) is steeper than —2/3 (figure HJi ) [3] eq. (|4TI) is verified and the kinetic caloric 
curve Ti{E — Ei) (dot-dashed line) crosses the potential one T2{E2) (full line) in three 
different points for all values of the total energy lying inside the region of coexistence of 
two kinetic energy maxima. The resulting caloric curve for the whole system is shown 
in figure (symbols) together with the thermodynamic limit (lines) evaluated from the 
double saddle point approximation (|4]). In this case one observes a temperature jump at 
the transition energy. If the temperature decrease is smoother (figure St) the shape of the 
interaction caloric curve is preserved at the thermodynamic limit (figure |4]1). 

The occurrence of a temperature jump in the thermodynamic limit is easily spot- 
ted by looking at the bidimensional canonical event distribution Pp{Ei,E2) shown at 
the transition temperature /3 = /3f in the left part of figure \5\ for the two model equa- 
tion of states of figure IH In the canonical ensemble the kinetic energy distribution is 
normal. These same distributions are shown as a function of E and Ei, Pp{E,E\) oc 
exp5i(£'i)exp52(£' — £'i)exp(— /3£') in the right part of figure |5l The microcanonical 
ensemble is a constant energy cut of Pp{E,Ei), which leads to the microcanonical dis- 



tribution Pe{E[) within a normalization constant. If the anomaly in the potential equation 
of state is sufficiently important, the distortion of events due to the coordinate change is 
such that one can still see the two phases coexist even after a sorting in energy. 



DEFINITIONS OF PHASE TRANSITIONS IN FINITE SYSTEMS 

Phase transitions are universal properties of matter in interaction. In macroscopic 
physics, they are singularities (i.e. non-analitical behaviors) in the system equation of 
state (EoS) and hence classified according to the degree of non-analyticity of the EoS 
at the transition point. Then, a phase transition is an intrinsic property of the system 
and not of the statistical ensemble used to describe the equilibrium. Indeed, at the 
thermodynamic limit all the possible statistical ensembles converge towards the same 
EoS and the various thermodynamic potentials are related by simple Legendre trans- 
formations leading to a unique thermodynamics. On the other side for finite systems, 
as discussed above, two ensembles which put different constraints on the fluctuations 
of the orderparameter lead to qualitatively different EoS close to a first order phase 
transition [25', Thermodynamic observables like heat capacities can therefore be 
completely different depending on the experimental conditions of the measurement. 
Moreover, such inequivalences may survive at the thermodynamic limit if forces are 
long ranged as for self gravitating objects[[3|, 3]. In fact the characteristic of phase 
transitions in finite systems, and in particular the occurrence of a negative heat capacity. 



have first been discussed in the astrophysical context[2, 31, 26, 69, El iti a. 
Since these pioneering works in astrophysics, an abundant literature is focused on 
the understanding of phase transition in small systems from a general point of view 
H H H H S & S S H or in the mean-field context [4^, ,8fl] or for some 



specific systems such as metallic clusters [|48,. .68,1 or nuclei [ . 81.1 and even DNA [82] . 



Phase transitions in infinite systems 

Let us first recall the definition of phase transitions in infinite systems. At the thermo- 
dynamic limit for short range interactions the statistical ensembles are equivalent and it 
is enough to reduce the discussion to the ensemble where only one extensive variable 
is kept fixed, all the others being constrained through the associated Lagrange pa- 
rameters. The typical example is the grandcanonical ensemble where only the volume 
Ai = V is kept as an extensive variable. Then all the thermodynamics is contained in the 
associated potential log Z^^ (^l)- Since it is extensive, the potential is proportional 
to the remaining extensive variable 

logZ;t, =AMXi,..,Xl-i) (43) 

so that all the non trivial thermodynamic properties are included in the reduced potential 
i.e. the intensive variable 



= logZ,, (A,) = (44) 

associated with A^. In the grand canonical case = V, the reduced potential is the pres- 
sure, Xt oc p, which is then a function of the temperature and the chemical potential(s). 
In this limit all the thermodynamics is included in the single function A^-i), 
and this is why in the literature p{V) is often loosely refered to as '"the"' EoS, and the 
existence of many EoS is ignored. If this EoS is analytical, all the thermodynamic quan- 
tities which are all derivatives of the thermodynamic potential, present smooth behav- 
iors, and no phase transition appears. A phase transition is a major modification of the 
macrostate properties for a small modification of the control parameters (Ai,..., 
Such an anomalous behavior can only happen if the thermodynamic potential presents 
a singularity. This singularity can be classified according to the order of the derivative 
which presents a discontinuity or a divergence. According to Ehrenfest this is the order 
of the phase transition. In modem statistical mechanics, all the higher order are called 
under the generic name of continuous transitions. Figure [6] schematically illustrates a 
first order phase transition in the canonical ensemble. 



Phase transitions in finite systems 

As soon as one considers a finite physical system, all the above discussion does not 
apply. First the thermodynamic potential and observables are not additive, therefore we 
cannot introduce a reduced potential and we get 



'^log^Ai,...A-iW 

<5Ai 

logZAi....A-i(^^) 



(45) 



i.e. the grand potential per unit volume does not give the pressure any more, and 
presents a non-trivial volume dependence. Moreover the analysis of the singularities 
of the thermodynamic potential has no meaning, since it is an analytical function. The 
standard statistical physics textbooks thus conclude that rigourously speaking there is 
no phase transitions in finite systems. However, as we have already mentioned, first 
for self-gravitating objects jB 26, [sl, [6^, [v^, IzE [v^, IvS] and then in small systems 



i, H [30, 48, 3, H S HMIHI] it was shown that phase transitions might be 
associated with the occurrence of negative microcanonical heat capacities. This can be 
generalized to the occurence of an inverted curvature of the thermodynamic potential 
of any ensemble keeping at least one extensive variable A^ not orthogonal to the order 
parameter 0, [ssl]. In the following we call this ensemble an extensive ensemble. 



orthogonality is here defined using the trace as a scalar product between observables following section 




6q"^ Temperature 

FIGURE 6. Schematic representation of a first order phase transition in the canonical case. Top: the log 
of the canonical partition sum (i.e. the free energy) presents an angular point Bottom: the first derivative 
as a function of the temperature (i.e. the energy) presents a jump. 



Then, negative compressibility or negative susceptibility should be, like negative heat 
capacity, observed in first order phase transitions of finite systems. In the microcanonical 
ensemble of classical particles, it was proposed that anomalously large fluctuations of 
the kinetic energy, i.e. larger then the the expected canonical value, highlight a negative 
heat capacity [|84ll . It was then demonstrated that those two signals of a phase transition, 
negative curvatures and anomalous fluctuations, observed in extensive ensemble where 
the order parameter is fixed, are directly related to the appearance of bimodalities 
in the distribution of this order parameter in the intensive ensemble where the order 
parameter is only fixed in average through its conjugated Lagrange multiplier pii 
The occurrence of bimodalities is discussed in the literature since a long time and is 
often used as a practical way to look for phase transition in numerical simulations 
[[l7ll47|] : however, the general equivalence between negative curvatures and bimodalities 
was presented in ref. [|44ll . For intensive ensembles, since the pioneering work of Yang 
and Lee [|43ll another definition was proposed considering the zeroes of the partition sum 




FIGURE 7. Schematic representation of the zeroes of the partition sum Z in the complexe temperature 
plane. The regions where no zeroes are coming close to the real axis when the thermodynamic limit is 
taken will not present singularities of logZ. 



in the complex intensive parameter plane 11431 18511 . The idea is simple: the zeroes of 
Z are the singularities of logZ and so phase transitions, which are singularities, must 
come from the zeroes of the partition sum. In a finite system the zeroes of the partition 
sum cannot be on the real axis since the partition sum Z is the sum of exponential 
factors which cannot produce a singularity of logZ. However, the thermodynamic limit 
of an infinite volume may bring the singularity on the real axis. This is schematically 
illustrated in fig. U\ Only regions where zeroes converge towards the real axis may 
present phase transitions, while the other regions present no anomalies. The order of 
the transition can be associated to the asymptotic behavior of zeroes[85]. 

The distribution of zeroes has been analyzed in ref.[45] where the transition was 
studied with a parabolic entropy. In ref. [46] the equivalence of the expected behavior 
of the zeroes in a first order phase transition case and the occurrence of bimodalities in 
the distribution of the associated extensive parameter was demonstrated. To be precise, 
in this demonstration bimodality means that the extensive variable distribution can be 
splitted at the transition point into two distributions of equal height, with the distance 
between the two maxima scaling like the system size[f86l]. 

This global picture of phase transitions in finite systems is summarized in figure [8] 
in the case where energy is the order parameter of the transition. The occurrence of a 
bimodal distribution of the extensive parameter (e.g. energy) in the associated intensive 
(e.g. canonical) ensemble is a necessary and sufficient condition to asymptotically get 
the distribution of the Yang-Lee zeroes in the complex lagrange multiplier (e.g. temper- 
ature) plane, which is expected in a first order transition. The direction of bimodality is 
the direction of the order parameter. This bimodality is also equivalent to the presence of 
an anomalous curvature in the thermodynamic potential of the extensive (microcanon- 
ical) ensemble obtained constraining the bimodal observable to a fixed value. In the 
extensive ensemble, the inverted curvature can be spotted looking for anomalously large 
fluctuations (e.g. larger then the canonical ones) of the partition of the extensive variable 
(e.g. energy) between two independent subsystems. 




FIGURE 8. Schematic representation of the different equivalent definitions of first order phase tran- 
sitions in finite systems. From top to bottom: the partition sum's zeroes aligning perpendicular to the 
real temperature axis with a density scaling like the number of particles; the bimodality of the energy 
distribution with a distance between the two maxima scaling like the number of particles times the latent 
heat; the appearence of a back-bending in the microcanonical caloric curve i.e. a negative heat capacity 
region; and the observation of anomalously large fluctuations of the energy splitting between the kinetic 
part and the interaction part. 



STATISTICAL DESCRIPTION OF EVOLVING SYSTEMS 



A major issue in the statistical treatment of finite systems is that most of the time open 
and transient systems are studied. Therefore, they are not only finite in size but also finite 



in time, and in fact they are evolving. The number of degrees of freedom of a quantum 
many-body problem being infinite, it is impossible to have all the information needed to 
solve exactly the dynamical problem. Since only a small part of the observation space 
is relevant, this time evolution may also be treated with statistical tools. This is the 
purpose of many models: from Langevin approaches to Fokker-Planck equations, from 
hydrodynamics to stochastic Time Dependent Hartree Fock theory. The purpose of this 
chapter is not to review those theoretical approaches, therefore we will not enter here into 
details about the different recent progresses, and we will rather focus this discussion on 
general arguments of time dependent statistical ensembles f52L[55ll. 

A statistical treatment of a dynamical process is based on the idea that at any time 
one can consider only the relevant variables A(, disregarding all the other ones a,n as 
irrelevant. If only the maximum entropy state is followed in time assuming that all 
the irrelevant degrees of freedom have relaxed instantaneously, one gets a generalized 
mean-field approach [61]. If the fluctuations of the irrelevant degrees of freedom are 
included, this leads to a Langevin dynamics[62]. With those considerations one can see 
that statistical approaches can be always improved including more and more degrees of 
freedom to asymptotically become exact. However, before including a huge number 
of degrees of freedom one should ask himself if only a few observables can take 
care of the most important dynamical aspects of the systems we are looking at. In 



a recent paper HSSN it was proposed to introduce observations at different times (e.g. 
different freeze-out/equilibration times) as well as time odd extensive parameters. The 
idea is simple : maximizing the Shannon entropy with different observables A( known 
at different times t( = to + At£ is a way to treat a part of the dynamics. Going to the 
Heisenberg representation, if we propagate all the to the same time to we get: 

Aeito) = e-'^'^^A(e'^'^^ (46) 
= Ae-iAte[H,Ae]+... (47) 

This shows that the time propagation introduces new constraining operators 

Be = -i[H,Ae]. (48) 

If is a time even observable, Bi is a time odd operator. Let us take the example of an 
unconfined finite system characterized at a given time by a typical size < >=< S >, 
where is the one body operator Y.i If the whole information is assumed to be 
known at the same time, then the statistical distribution of event reads in a classical 
canonical picture 

which is formally equivalent to a particle in a harmonic potential. However, if now we 
assume that the size information is coming from a different time, then according to Eq. 
(|48l) we must introduce a new time odd operator = —i[H., f']. For a local interaction, 
this reduces to 



Vr = [rp + pr)/m 



(50) 



which represents a radial flow. Then the classical canonical probability reads 



pin) = lg-)8(p(")-A(0r("')'-A5(«) (5^^ 

z 

which is a statistical ensemble of particles under a Hubblian flow. In the ideal gas 
model eq. (|5TI) provides the exact solution at any time of the dynamics. This simple 
example shows that information theory allows treat in a statisticl picture dynamical pro- 
cesses where observables are defined at different times, by taking into account time odd 
components such as flows. This might be a tool to extract thermodynamical quantities 
from complex dynamics. In particular, the above example shows that in an open system 
an initial extension in space is always transformed into an expansion, meaning that flow 
is an essential ingredient even in statistical approaches. 



CONCLUSION 

In conclusion, we have presented in this paper the actual understanding of the thermody- 
namics of finite systems from the point of view of information theory. We have put some 
emphasis on first order phase transitions which are associated to specific and intriguing 
phenomena as bimodalities and negative heat capacities. Phase transitions have been 
widely studied in the thermodynamic limit of infinite systems. However, in the physical 
situations considered here (i.e. small systems or non saturating forces), this limit cannot 
be taken and phase transitions should be reconsidered from a more general point of view. 
This is for example the case of matter under long range forces like gravitation. Even if 
these self gravitating systems are very large they cannot be considered as infinite be- 
cause of the non saturating nature of the force. Other cases are provided by microscopic 
or mesoscopic systems built out of matter which is known to present phase transitions. 
Metallic clusters can melt before being vaporized. Quantum fluids may undergo Bose 
condensation or a super-fluid phase transition. Dense hadronic matter should merge in a 
quark and gluon plasma phase while nuclei are expected to exhibit a liquid -gas phase 
transition and a superfluid phase. For all these systems the theoretical and experimental 
issue is how to define and sign a possible phase transition in a finite system. In this re- 
view we have presented the synthesis of different works which tend to show that phase 
transitions can be defined as clearly as in the thermodynamic limit. Depending upon the 
statistical ensemble, i.e. on the experimental situation, one should look for different sig- 
nals. In the ensemble where the order parameter is free to fluctuate (intensive ensemble), 
the topology of the event distribution should be studied. A bimodal distribution signals a 
first-order phase transition. The direction in the observable space in which the distribu- 
tion is bimodal defines the best order parameter. To survive the thermodynamic limit, the 
distance between the two distributions, the two " 'phases'", should scale like the number 
of particles. This occurrence of a bimodal distribution is equivalent to the alignment of 
the partition sum zeroes as described by the Yang and Lee theorem. In the associated 
extensive ensemble, the bimodality condition is also equivalent to the requirement of 
a convexity anomaly in the thermodynamic potential. The first experimental evidences 
of such a phenomenon have been reported recently in different fields: the melting of 
sodium clusters [6],the fragmentation of hydrogen clusters[8], the pairing in nuclei|@] 



and nuclear multifragmentation 10, 87, 88]. However, much more experimental and the- 
oretical studies are now expected to progress in this new field of phase transitions in 
finite systems. Three challenges can thus been assigned to the physics community: 

• The statistical description of non extensive systems and in particular of open tran- 
sient finite systems; 

• The experimental and theoretical study of phase transitions in thoses systems and 
of the expected abnormal thermodynamics 

• The confirmation of the observation of the phase transition and the analysis of the 
associated equation of state properties and the associated phase diagram. 
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